NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

LiRa: A New Likelihood-Based Similarity Score For Collaborative Filtering

Strnadova-Neeley, Veronika; Buluc, Aydin; Gilbert, John R; Oliker, Leonid; Ouyang, Weimin (September 2016, LSRS'16: RecSys Workshop on Large-Scale Recommender Systems (10th ACM Conference on Recommender Systems:))

Recommender system data presents unique challenges to the data mining, machine learning, and algorithms communities. The high missing data rate, in combination with the large scale and high dimensionality that is typical of recommender systems data, requires new tools and methods for efficient data analysis. Here, we address the challenge of evaluating similarity between two users in a recommender system, where for each user only a small set of ratings is available. We present a new similarity score, that we call LiRa, based on a statistical model of user similarity, for large-scale, discrete valued data with many missing values. We show that this score, based on a ratio of likelihoods, is more effective at identifying similar users than traditional similarity scores in user-based collaborative filtering, such as the Pearson correlation coefficient. We argue that our approach has significant potential to improve both accuracy and scalability in collaborative filtering.
more » « less
Full Text Available
An Empirical Study of Cycle Toggling Based Laplacian Solvers

https://doi.org/10.1137/1.9781611974690.ch4

Deweese, Kevin; Gilbert, John R; Miller, Gary; Peng, Richard; Xu, Hao Ran; Xu, Shen Chen (October 2016, Proceedings of the Seventh SIAM Workshop on Combinatorial Scientific Computing)

We study the performance of linear solvers for graph Laplacians based on the combinatorial cycle adjustment methodology proposed by [Kelner-Orecchia-Sidford-Zhu STOC-13]. The approach finds a dual flow solution to this linear system through a sequence of flow adjustments along cycles. We study both data structure oriented and recursive methods for handling these adjustments. The primary difficulty faced by this approach, updating and querying long cycles, motivated us to study an important special case: instances where all cycles are formed by fundamental cycles on a length n path. Our methods demonstrate significant speedups over previous implementations, and are competitive with standard numerical routines.
more » « less
Full Text Available
Mathematical foundations of the GraphBLAS

https://doi.org/10.1109/HPEC.2016.7761646

Kepner, Jeremy; Aaltonen, Peter; Bader, David; Buluc, Aydin; Franchetti, Franz; Gilbert, John; Hutchison, Dylan; Kumar, Manoj; Lumsdaine, Andrew; Meyerhenke, Henning; et al (September 2016, Proceedings of the IEEE High Performance Extreme Computing Conference)

The GraphBLAS standard (GraphBlas.org) is being developed to bring the potential of matrix-based graph algorithms to the broadest possible audience. Mathematically, the GraphBLAS defines a core set of matrix-based graph operations that can be used to implement a wide class of graph algorithms in a wide range of programming environments. This paper provides an introduction to the mathematics of the GraphBLAS. Graphs represent connections between vertices with edges. Matrices can represent a wide range of graphs using adjacency matrices or incidence matrices. Adjacency matrices are often easier to analyze while incidence matrices are often better for representing data. Fortunately, the two are easily connected by matrix multiplication. A key feature of matrix mathematics is that a very small number of matrix operations can be used to manipulate a very wide range of graphs. This composability of a small number of operations is the foundation of the GraphBLAS. A standard such as the GraphBLAS can only be effective if it has low performance overhead. Performance measurements of prototype GraphBLAS implementations indicate that the overhead is low.
more » « less
Full Text Available

Search for: All records